1 



METHOD AND APPARATUS FOR MEASURING OPTICAL OVERLAY DEVIATION 

BACKGROUND OF THE INVENTION 

1. Field of Invention 

[0001] The present invention relates generally to optical inspection of semiconductor 
substrates during the production of semiconductor devices. In particular, the present 
invention relates to methods and apparatus for detecting the overlay deviation between two 
patterned layers on a semiconductor substrate. 

2. Description of Related Art 

[0002] One of the most critical process control techniques used in the manufacturing 
of integrated circuits is the measurement of overlay accuracy between patterned layers, i.e., 
the measurement of how accurately a patterned layer aligns with respect to another patterned 
layer located (directly or indirectly) above or below that layer. 

[0003] Overlay measurement typically is performed on a patterned resist layer prior 
to etching that layer. If a substrate contains layers that are out of alignment, then the substrate 
should be reworked (i.e., the photoresist layer(s) are stripped) and returned to the 
photolithography process to be exposed again at a corrected alignment. Typically, the overlay 
error is measured, and then the results of the measurement are fed back into the control system 
that controls the alignment of layers. 

[0004] Measurement patterns, formed in each of the patterned layers, are used to 
measure the overlay deviation. The most commonly used measurement patterns are squares, 
generally referred to as "bar-in-bar" marks, an example of which is shown in Fig. 2A. An 
outer bar mark 5 1 is printed on a process layer (lower layer) with side-to-side dimensions of 
approximately 30 micrometers, and an inner bar mark 52 is printed on a photoresist layer 
(upper layer) with a side-to-side dimension of approximately 15 micrometers. The outer bar 
mark 51 and the inner bar mark 52 are detected by a microscope system having a CCD 
camera, for example. 

[0005] The overlay deviation of the photoresist layer in terms of the process layer is 
a positional deviation between the outer bar mark 51 and the inner bar mark 52. Hereafter, 
the overlay deviation will be referred to as an "overlay-offset." Fig. 2B illustrates a CCD 
output signal waveform taken along the line 53 shown in Fig. 2A. A signal 54L, a signal 54R, 
a signal 55L and a signal 55R correspond to the signals emitted from points 51L, 51R, 52L 
and 52R, respectively. A median point 54C is calculated from the signals 54L and 54R. 
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Similarly, a median point 55C is calculated from the signals 55L and 55R. The overlay-offset 
in the X-direction is determined by calculating the difference between the median point 54C 
and the median point 55C. This same method is used to determine the overlay-offset in the 
Y-direction. A zero overlay-offset at all alignment marks on the substrate (the substrate can 
be, for example, a silicon wafer, a glass or quartz substrate, or a substrate made from 
semiconductor materials other than silicon) means that the photoresist layer is exactly aligned 
with the (underlying) process layer. In this case, the substrate is forwarded to the next 
manufacturing process without any reworking being done. 

[0006] However, even if the measured overlay-offset is 0, a displacement error 
between the two layers may exist. Generally, there are two reasons for this (unmeasured) 
displacement error. The two reasons are tool induced shift (TIS) and wafer induced shift 
(WIS), to be explained below. 

[0007] Tool induced shift is an error in the measurement which is caused by the tool 
which performs the measurement. The tool includes the CCD, the optical system of the tool 
(which includes the illumination system for illuminating the substrate and the detection system 
optics by which the image of the substrate is observed by the CCD), and software that is used 
to control the system. TIS can be caused, for example, when the illumination light is not 
telecentric or when an optical axis of the illumination system and/or of the detection optical 
system is not exactly perpendicular to the substrate. TIS can be calculated by the following 
equation (1): 



TIS = (M(0)+M(180))/ 2 Equation (1) 

in which M(0) is the overlay-offset calculated at a 0° orientation of the alignment mark, and 
M(180) is the overlay-offset calculated when the substrate is rotated by 180° (known as the 
180° orientation of the alignment mark). In Fig. 2 A, the sheet surface is defined as an X-Y 
plane, and an axis perpendicular to the sheet is defined as the Z-axis. The overlay-offset is 
measured at 0 = 0° orientation of the substrate, and then the substrate (including the bar-in-bar 
mark) is rotated by 9 = 180° about the Z-axis so that overlay-offset at 0 = 180° can be 
measured. Thus, M(0) and M(l 80) can be measured in order to determine TIS. In order to 
minimize the displacement error, TIS should be compensated. An overall TIS corrected value 
(referred to as TCV (TIS corrected value)) is used as an overlay-offset value for an evaluation 
of the alignment. The following equation (2) shows how TCV is calculated: 
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TCV = M(0)-TIS = (M(0)-M(180))/ 2 



Equation (2) 



[0008] Wafer induced shift (i.e., errors in the measurement results due to wafer 
induced shift) occurs due to characteristics of the patterned layers formed on the substrate. 
One cause of WIS is an asymmetric (non-uniform) processing that is performed on the film 
layers formed on the substrate. For example, if a step that removes or polishes part of a layer 
from the substrate, or if a step that deposits a layer onto the substrate, is not performed 
uniformly oyer the entire substrate, it may cause WIS. For example, certain layers are made 
very thin by performing a chemical mechanical planarization (CMP) process on that layer in 
order to remove a part of the thickness of the layer. This CMP process typically is performed 
using a rotating disk that is contacted with the uppermost layer on the substrate in order to 
remove part of that layer. Since the speed of the disk at its outer perimeter is higher than the 
speed of the disk near its center, different effects can be caused on the substrate layer near the 
outer perimeter of the disk compared to the inner (center) portion of the disk. For example, 
the directions of throughholes can be become tilted (in the direction in which the disk rotates) 
as one progresses from the center of the substrate toward the outer perimeter thereof. In 
addition, during deposition, typically a source of the deposited material is located above the 
substrate at the location of the central axis of the substrate. The vaporized particles to be 
deposited on the substrate are emitted from this point source, and can become deposited 
unequally between the center and outer perimeter of the substrate. These asymmetric 
processing steps can cause WIS errors in the measurements made during the overlay 
measurement process. WIS should be compensated for as well as TIS. However, WIS 
compensation is very difficult and therefore is not done very frequently, as described below. 

[0009] Typically substrates are processed in batches (also called "lots") 
corresponding to the number of substrates that can be held in a cassette. A typical cassette 
capacity is 25 substrates. Overlay inspection typically is performed on three wafers from each 
lot. Thus, if a lot includes 25 substrates, the overlay inspection (including TIS evaluation) is 
conducted on, for example, the first, tenth and twentieth substrates of the lot. However, in 
order to measure WIS, a process referred to as after etching inspection (AEI) is performed. 
AEI is time consuming and involves viewing the substrate using, for example, a scanning 
electron microscope (SEM). AEI, however, damages the circuitry formed on the substrate, 
such that the measured substrate must be reworked. AEI is not a regular procedure, and 
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therefore typically is conducted once per day or once per week or once per month. It is not 
practical to perform AEI on each lot of substrates. 

[0010] Thus, WIS is not regularly evaluated. Solving issues related to WIS is more 
important than ever because, with increasingly smaller circuit patterns, process control 
technologies (such as overlay accuracy) become more and more critical. The required 
precision specification for overlay machines is about 2.5 nm for 70 nm technology node. 
Currently, the precision of overlay measurement, which defines measurement repeatability, is 1 
nm or less for the actual process substrate. The precision performance is sufficient for the 
current overlay specification requirements. On the other hand, the accuracy performance, i.e., 
how close the measurement is to a true overlay-offset without WIS compensation, is still 
over 5-10 nm. Therefore, it is desirable to measure WIS and compensate WIS values for the 
measured overlay-offset values, in order to improve. Thus, as noted above, WIS is 
periodically measured and compensated for. It would, however, be desirable to evaluate WIS 
more frequently without requiring frequent performance of AEI. 

SUMMARY OF THE INVENTION 

[0011] Accordingly, one aspect of the present invention is to provide apparatus and 
methods that can characterize and monitor WIS in real time in order to improve the overlay 
measurement accuracy. 

[0012] According to one aspect of the invention, an apparatus and method for 
measuring overlay deviation between two layers is capable of evaluating WIS in a simple 
manner without necessarily requiring AEI. A wafer having alignment marks provided in two 
layers is placed on a stage device that is movable in X, Y, Z and 0 directions. The substrate is 
provided in the X-Y plane, and the Z-axis is perpendicular to the substrate surface (i.e., the 
X-Y plane). The apparatus can measure TIS at substrate orientations of 0° and 180°, and 
calculate TCV for each alignment mark. The apparatus also measures TIS -through-focus (i.e., 
TIS at different focus positions) by moving the wafer along the Z-axis and calculating TIS at a 
plurality of Z-axis positions. TCV-through-focus (i.e., TCV values at the different focus 
positions) can be calculated for each alignment mark using the values obtained for TIS- 
through-focus. If the layers formed on the substrate suffer from WIS, the TCV values will 
change at different Z-axis positions (i.e., at different focus positions) of the substrate. The 
present invention, thus, can display a vector map of the TCV-through-focus values so that an 
observer of the vector map can determine whether the wafer suffers from WIS. 
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[0013] By monitoring (observing) the WIS characteristics of a wafer from each lot, 
one can readily observe whether the WIS characteristics of the wafers have changed. Thus, 
while the invention does not necessarily measure the WIS values, it enables one to monitor the 
WIS characteristics of wafers using existing components of the overlay measurement machine 
(i.e., components that are used to measure TIS). If the WIS characteristics change (for 
example, if the size of the vectors on the displayed vector-map change, for example, by 
increasing in length), then AEI can be performed in order to measure the WIS values and then 
compensate for the changes in WIS. 

[0014] According to one embodiment, the apparatus calculates d/dz TCV 
(differentiated TCV by z), and then plots the d/dz TCV vector for each alignment mark on a 
two dimensional map. The d/dz TCV vector is defined by two directional components, X 
and Y. The d/dz TCV vector-map that is displayed reflects WIS, e.g., if the d/dz TCV vector- 
map shows rotational direction, then the substrate can be characterized as suffering from 
rotational WIS. By comparing the d/dz TCV vector-map with a d/dz TIS vector-map, the 
apparatus and method can distinguish TIS from WIS. For example, when both the d/dz TIS 
vector-map and the d/dz TCV vector-map show a similar directional component, the wafer 
can be deemed to not suffer from WIS. However, when the d/dz TIS vector-map shows a 
directional component and the d/dz TCV vector-map shows a magnification component, the 
wafer can be deemed to suffer from magnification WIS. 

[0015] Another aspect of the invention relates to a method for monitoring 
fluctuation in WIS. Many processes need to be controlled in order to stably manufacture 
semiconductor devices. For example, the film stack thickness, deposition conditions, chemical 
mechanical planarization (CMP) conditions, etc., need to be controlled. However, unexpected 
changes can occur, for example, a wafer can be over CMP polished, or an efficiency of 
deposition can be reduced due to shrinkage (depletion) of a source of the deposited material 
over the vaporization-life of the material. WIS is periodically measured and compensated for 
to improve accuracy. Thus, such unexpected changes can be detected. If, on the other hand, 
such unexpected changes are not detected, the same compensation values will continue to be 
used up until the next WIS evaluation period, and therefore, an over-WIS-compensation or an 
under- WIS-compensation would occur. This leads to serious mis-alignment errors, which are 
out of the overlay budget. As noted above, the apparatus can monitor the lot-by-lot 
fluctuation of WIS by measuring d/dz TCV and d/dz TIS for each lot of wafers. Thus the 
period of time is reduced in which unexpected WIS changes remain uncompensated. 



[0016] Thus, an apparatus and method of the invention analyze overlay deviation in 
alignment between a first mark and a second mark that are formed on a substrate. In 
particular, a relationship between changes in overlay deviation values and changes in focus 
position of the substrate for a plurality of sets of the first and second marks that are provided 
on the substrate is calculated, and then an output is provided from which a user can determine 
whether the substrate suffers from wafer-induced-shift. Preferably, the output is a vector-map 
showing the relationship between changes in overlay deviation values and changes in focus 
position of the substrate for the plurality of sets of marks. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0017] The invention will be described with reference to the following drawings in 
which like reference numerals designate like parts, and wherein: 

[0018] Fig. 1 schematically shows a construction of an optical overlay deviation 
measuring apparatus according to one embodiment of the invention; 

[0019] Fig. 2A is a plan view showing an alignment mark; 

[0020] Fig. 2B is a graph showing the signal waveform output by the imaging device 
(e.g., a CCD) of the optical overlay deviation measuring apparatus; 

[0021] Fig. 3 A is a cross-sectional view showing the diffraction light emitted from 
an alignment mark; 

[0022] Fig. 3B is a cross-sectional view and a signal waveform graph for an ideal 
alignment mark; 

[0023] Fig. 3C is a cross-sectional view and a signal waveform graph for an 
alignment mark suffering from WIS; 

[0024] Fig. 4A is a graph showing TCV-through-focus; 
[0025] Fig. 4B is a graph showing TIS-through-focus; 

[0026] Fig. 5A is a vector-map and statistical analysis result showing overlay-offset; 
[0027] Fig. 5B is a vector-map and statistical analysis result showing TIS-through- 
focus; 

[0028] Fig. 5C is a vector-map and statistical analysis result showing TCV-through- 
focus; and 

[0029] Fig. 5D is a vector-map and statistical analysis result showing TCV-through- 
focus for another wafer of a different lot. 
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DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 
[0030] A preferred exemplary embodiment of the invention will be described with 
reference to the drawings. 

[0031] Fig. 1 shows one example of an apparatus for measuring an overlay-offset 
according to the present invention. A direction perpendicular to the sheet surface is set as the 
Y-axis direction, directions extending to the right and left are set as the X-axis direction, and 
directions extending up and down are set as the Z-axis directions. Rotation about the Z-axis is 
set as 6 in Fig. 1. 

[0032] The measuring apparatus measures overlay-offset, i.e., a positional deviation 
in alignment of an alignment mark 1 formed on a substrate such as wafer 2. As is known, 
there are many alignment marks la and lb on the wafer 2. The wafer 2 is placed on a stage 3 
during measurement of the overlay-offset. The stage 3 is movable in X and Y-directions, and 
is rotatable in the 0 direction (i.e., about the Z-axis). Thus, overlay-offset can be measured for 
a wafer at a 0° orientation and at a 1 80° orientation. 

[0033] An X-Y position-measuring device 4 measures the X-Y coordinates of the 
alignment mark 1, in the usual manner. 

[0034] The stage 3 also is movable up and down (i.e., movable along the Z-axis 
direction) by a focus-moving device 5. The focus-moving device 5 includes a linear scale 6 to 
measure the position of the wafer 2 in the Z-axis direction. The focus-moving device 5 can 
move the stage 3 with 0.01 |i,m accuracy over a range of 6 p,m (i.e., +3 [im from a zero- 
position and -3 p,m from the zero-position). 

[0035] The apparatus includes an illumination optical system 10 that includes a light 
source 1 1 (e.g., a halogen light source), a condenser lens 12 and a half prism 13. The 
collimated illumination beam is incident upon the half prism 13, and the light reflected by the 
half prism 13 exits downward and is converged by an objective lens 14. The converged 
illumination beam 15 falls vertically upon the alignment mark 1 disposed on the wafer 2. 

[0036] The apparatus also includes an image-forming optical system 20 that includes 
the objective lens 14 and an image-forming lens 21. A reflected beam 16 is led via the image- 
forming optical system 20 to an imaging device 22 (e.g., a CCD camera). The reflected 
beam 16 is collimated into a parallel beam by the objective lens 14 and, after passing through 
the half prism 13, is converged by the image-forming lens 21 onto the imaging device 22. 

[0037] An image-processing device 23 calculates the median point 54C and the 
median point 55C illustrated in Fig. 2B, and then calculates the overlay-offset in the 
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X-direction. A similar method is used to calculate the overlay-offset in the Y-direction. A 
half prism 24 is located between the half prism 13 and the image-forming lens 21 to separate a 
reflected beam 25 that exits the half prism 13. An automatic focusing unit 74 includes the half 
prism 24, a dual-wedge prism 26 and a linear CCD 27. A beam passed through the dual- 
wedge prism 26 is separated into two beams 28a and 28b. When the focus-moving device 5 
moves the stage 3 up or down, the parallelism of the reflected beam 25 changes and an 
angle 29 between the beams 28a and 28b becomes smaller or larger (narrower or wider) 
respectively. The linear CCD 27 measures the distance from the beam 28a to the beam 28b, 
and fixes an optimum Z-position as a zero-position (hereafter, the zero-position is referred to 
as AF - 0 position). 

[0038] In order to calculate TIS, the apparatus measures the overlay-offset of the 
alignment mark 1 at 0 = 0° orientation of the wafer 2, and then rotates the wafer 2 using the 
stage 3 to put the wafer at the 9 = 180° orientation, and then measures the overlay-offset of 
the alignment mark 1 at 0 = 180°. TIS and TCV in the X-direction are calculated using the 
overlay-offset value M(0) at 0° X-direction and the overlay offset value M(180) at the 180° 
X-direction orientations using equations (1) and (2), respectively. TIS and TCD in the 
Y-direction are obtained by a similar procedure. 

[0039] Fig. 3 A is a cross-sectional view of a substrate layer, and shows an alignment 
mark. In general, the diffraction light emitted from the alignment mark is affected by various 
parameters such as, for example, the wavelength X of the light, the refractive index n of the 
material, the diffraction angle (p, the step-height h of the mark, the width p of the mark, the 
tapered angles (|)1 and 4>2 of the mark, and the roundness rl and r2 of the bottom edges of the 
mark. The diffraction light is collimated by the objective lens 14, and imaged onto the imaging 
device 22. 

[0040] Fig. 3B illustrates a cross section on the line 53 (shown in Fig. 2A) and a 
CCD output signal waveform 56. An outer mark 61 is placed in a process layer 60 
(polysilicon, aluminum, etc.) and an inner mark 62 (photoresist layer) is placed onto the 
process layer 60. Ideally, both a left side outer mark 61L and a right side outer mark 61R are 
symmetric with respect to a perpendicular line 63L and 63R, so that a diffraction light 64L and 
a diffraction light 64R are symmetric with respect to the perpendicular lines 63L and 63R. 
Therefore, the overlay-offset can be measured accurately, and does not contain any WIS. 
Also, a median point 56C (calculated from the signals 56L and 56R) does not change when the 
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focus-moving device 5 moves the wafer up or down along the Z-axis. That is, the overlay- 
offset is stable through-focus. 

[0041] In an actual case, however, for example after CMP processing, the cross- 
sectional shape becomes tilted as shown in Fig. 3C. A diffraction light 66L emitted from a 
tilted mark 65L and a diffraction light 66R emitted from a tilted mark 65R are asymmetric 
with respect to the perpendicular lines 67L and 67R. In addition, a wavefront 68L of the 
diffraction light 66L, and a wavefront 68R of the diffraction light 66R are eccentric. 
Therefore, a signal 57L and a signal 57R of the CCD output signal waveform 57 shift from the 
ideal signal positions 56L and 56R (the ideal waveform is shown by the dotted line in Fig. 3C). 
Accordingly, a median point 57C shifts from the ideal median point 56C, i.e., the overlay- 
offset cannot be measured with high accuracy due to WIS. In addition, the median point 57C 
changes when the focus-moving device 5 moves the wafer 2 up or down along the Z-axis. 
That is, the overlay-offset is not stable through-focus. 

[0042] Fig. 4A shows the TCV-through-focus data for one alignment mark. In 
Fig. 4A, the horizontal axis shows a focusing position (the value 0 is the AF = 0 position), and 
the vertical axis shows TCV. Fig. 4B shows the TIS-through-focus data. Comparing Fig. 4A 
with Fig. 4B, TCV changes through-focus, although TIS appears to be stable through-focus. 
Therefore, this wafer is characterized to suffer from WIS. 

[0043] The invention makes use of this observation that a variation of the 
TCV-through-focus reflects the existence of WIS. The invention also can observe variations 
of the TCV-through-focus depending on the location of the alignment mark on the substrate. 
This is due to the fact that CMP and the deposition techniques can vary over the wafer, 
thereby causing WIS. 

[0044] According to the invention, it is possible to analyze a vector-map of 
TCV-through-focus. In order to analyze the vector-map of TCV-through-focus d/dz TCV is 
determined and evaluated. The d/dz TCV is a differential value of TCV over changes in the 
focus position Z, and is provided by the following equation (3): 

d/dz TCV = (TCV (z2)-TCV (zl)) / (z2-zl) Equation (3) 



in which TCV (zl) and TCV (z2) are the values of TCV at z = zl and TCV at z = z2, 
respectively. The d/dz TCV vector is defined by the following four values: d/dz TCV in the 
X-direction, d/dz TCV in the Y-direction, and coordinate values x and y. The d/dz TCV 
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vector-map can be determined and drawn/displayed for a number of alignment mark locations 
on the substrate. 

[0045] Referring back to Fig. 1, which shows an exemplary embodiment of the 
invention, the overlay-offset values calculated by the image processing device 23 are input to a 
TIS calculation device 30. The TIS calculation device 30 calculates TIS values for both the 
X- and Y-directions using equation (1) discussed above. A TCV calculation device (an 
overlay calculation device) 3 1 calculates TCV values for both the X- and Y-directions using 
equation (2) described above. 

[0046] Then, a d/dz TIS calculation device (TIS-through-focus calculation device) 
32 calculates a value of TIS differentiated over Z (i.e., change in TIS with change in focus 
position), based on the TIS information and the Z-positional information obtained from the 
linear scale 6 for different Z-positions. A d/dz TCV calculation device (an overlay-through- 
focus calculation device) 33 calculates a value of TCV differentiated over Z (i.e., a change in 
TCV with changes in the focus position), using equation (3), described above, based on the 
TCV information and the Z-positional information obtained from the linear scale 6 for 
different Z-positions. 

[0047] A d/dz TIS vector-map analysis device 34, a d/dz TCV vector-map analysis 
device 35, and a TCV vector-map analysis device 36 analyze the d/dz TIS vector trend, the 
d/dz TCV vector trend, and the TCV vector trend, respectively. In addition, the d/dz TIS 
vector-map analysis device 34, the d/dz TCV vector-map analysis device 35, and the TCV 
vector-map analysis device 36 calculate some statistical parameters simultaneously. A display 
device 37 displays the vector maps and the results of the statistical analyses, as shown in 
Figs. 5A-5C. These analyses and calculations are performed based on the coordinate values X 
and Y output by the X-Y positioning device 4. In particular, the TCV vector-map analysis 
device 36 calculates statistical parameters 70 (see Fig. 5 A) using the same calculation 
procedure as in existing measurement apparatus, which solves parametric equations in terms 
of the parameters 70, the overlay-offset values, and the coordinate values X and Y. The 
statistical parameters 70 are fed back as the alignment error values in the photolithography 
process. Existing measurement apparatus, which only determine TIS and TCV at a single 
focus position (i.e., AF=0), determine and output a vector-map showing overlay-offset (i.e., 
TCV) and statistical parameters. Even if statistical parameters are only four parameters, as 
shown in Figs. 5B-5C, the same simultaneous equations are used, substituting zero-value for 
the other four parameters. Thus, the vector-maps for TIS-through-focus and for TCV- 
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through-focus can be determined using the existing hardware and software for vector-map 
generation, with different data (i.e., the data of the calculated TIS-through-focus and of the 
calculated TCV-through-focus, described herein) being provided to that vector-map 
generation hardware and software. 

[0048] Figs. 5A-5C show a TCV (overlay-offset) vector-map, a d/dz TIS 
(TIS-through-focus) vector map and a d/dz TCV (TCV-through-focus) vector-map, 
respectively, for a wafer processed according to a particular process (referred to as process 
(A)) displayed on the display device 37. The display device 37 also displays the results of 
parameters that were analyzed statistically, to the right side of the vector-map. Note that the 
dimensions of the statistical parameters of d/dz TIS and d/dz TCV are the original dimensions 
divided by micrometer, e.g., translation: (j,m/p,m rotation: p,rad/^im. The d/dz TIS and the d/dz 
TCV information reflects sensitivity to a change in focus as given by equation (3). 

[0049] A higher NA (numerical aperture) of the objective lens 14 is not preferred, 
because it may be too sensitive for WIS, and may reduce the overlay measurement 
performance for a thick alignment mark. According to one preferred embodiment, NA of the 
objective lens 14 was 0.5. 

[0050] Comparing Fig. 5B (TIS-through-focus) with Fig. 5C (TCV-through-focus), 
it can be seen that the d/dz TIS vectors of Fig. 5B show a negligible translational trend, but 
that the d/dz TCV vectors of Fig. 5C are much larger and show a rotational trend (in the 
counter-clockwise direction) and a magnification trend (i.e., the vectors are smaller toward the 
center of the wafer), and that the rotational component is larger than the magnification 
component. Therefore, the wafer that was processed according to process (A) is deemed to 
suffer from WIS, which includes a dominant rotational component and a smaller grid- 
magnification component. A user of the optical overlay deviation measurement apparatus can 
reach these conclusions by observing the displayed vector-maps of Figs. 5B and 5C. It is easy 
to characterize WIS due to the different trends observed by comparing the d/dz TIS vector- 
map with the d/dz TCV vector-map. Typically, if TIS-through-focus is not stable, TIS 
changes linearly independent of an alignment mark location on the substrate because TIS is 
caused when the illumination light is not telecentric or when the optical system is not 
perpendicular to the substrate. In such a case, the d/dz TIS vector-map shows a translational 
trend but not a magnification or rotational trend. On the other hand, in the presence of WIS, 
the d/dz TCV vector-map of WIS would show a magnification trend or a rotational trend, but 
rarely shows a translational trend. 
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[0051] In order to achieve higher WIS characterization performance, a preferred 
embodiment includes a comparison device 80, shown schematically in Fig. 1 . The comparison 
device 80 compares statistical parameters 71 of the d/dz TIS vector-map with statistical 
parameters 72 of the d/dz TCV vector-map. According to the results for some actual wafers, 
the apparatus of the present invention can measure about 1 nm difference by a 1 um changing 
of the Z-position, i.e., it can measure d/dz TIS value or d/dz TCV value = O.OOlum/um. 
Therefore, the comparison device 80 determines a vector trend based on a criteria, based on 
whether the statistical parameter is more then 0.001 or not, at a first step. For instance, in the 
statistical parameters 71 shown in Fig. 5B, the d/dz TIS vector-map shows translation trend 
due to only translation X > 0.001. In the statistical parameters 72 showing in Fig. 5C, the 
d/dz TCV vector-map shows grid magnification and grid rotation trend due to both 
parameters > 0.001. The comparison device 80 compares the results of each trend, i.e., 
translation trend of d/dz TIS with magnification and rotation trend of d/dz TCV, at the second 
step. Thus, the comparison device 80 determines whether the substrate suffers from WIS 
automatically. 

[0052] According to the results for some actual process wafers analyzed by the 
present invention, it has been determined that rotational WIS exists in wafers mostly after they 
have been subjected to a CMP process, and that magnification WIS exists mostly in wafers 
after they have been subjected to a deposition process. A process condition could change 
unexpectedly, e.g., an over-polished CMP and a deposition target lifetime may change. The 
present invention provides a monitoring method to monitor WIS fluctuation caused by such 
process condition changes. By monitoring the wafers on a lot-by-lot basis, it is possible to 
observe that the d/dz TCV vectors increase in size, as shown in Fig. 5D, even if there is no 
change in the d/dz TIS vector-map. Actually, statistical parameters 73, i.e., grid magnification 
and grid rotation shown in Fig. 5D are twice larger than the statistical parameters 72. This 
means that a WIS trend has changed, and therefore it is advisable to measure the WIS value 
(for example, using AEI) so as to change the process conditions. 

[0053] The following describes one example of application of the invention to a 
high-volume manufacturing line. As noted above, it is typical to measure TIS, and thus 
determine TCV for three wafers in each lot (for example, the first, tenth and twentieth wafers). 
According to one example of use of the invention, WIS is characterized for one wafer in each 
lot. For example, one of the wafers that is normally subjected to TIS measurement is 
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measured at different focus positions in order to obtain the data that is used to calculate TIS- 
through-focus and TCV-through-focus. An example of the procedure is as follows: 

1) Measure the first wafer of the three wafers that are to be measured in a lot at two 
Z-positions, for example, z=0 and z=+1.0|am, by changing the Z-position of the wafer using 
the focus-moving device 5. 

2) Measure the second and third wafers that are to be measured in the lot (as noted 
above, these actually may be the tenth and twentieth wafers in the lot) only at the z=0 position. 

3) The TCV vector-map analysis device 36 analyzes the TCV vector trends and 
calculates the statistical parameters at the z=0 position for each of the three measured wafers. 
The display device 37 displays the vector-maps and the results of the statistical analysis for the 
three wafers, as is conventional. 

4) The d/dz TIS calculation device 32 subtracts TIS (z=0) from TIS (z=+lfim) for 
only the first measured wafer. The d/dz TCV calculation device 33 subtracts TCV (z=0) from 
TCV (z=+1.0|am) for only the first measured wafer. The display device 37 displays the 
vector-maps for TIS-through focus and TCV-through-focus and the results of the statistical 
analyses for the first measured wafer. 

Accordingly, a d/dz TCV vector-map is determined and displayed for one wafer in each lot so 
that an operator can monitor WIS on a lot-by-lot basis. 

[0054] The differential values often are small (for example, 5nm difference between 
z=0 and z=+1.0|j,m). However, the apparatus of the invention can measure this small 
difference because the precision performance is less than lnm. Thus, according to steps 
(l)-(4), WIS monitoring is available concurrently with the regular overlay measurement, 
without requiring any additional structure other than the calculation devices. 

[0055] As mentioned previously, the invention is applicable to the measurement of 
various types of substrates, not only wafers. The substrate can be a silicon wafer, a 
semiconductor wafer made from a material other than silicon, a glass substrate or a quartz 
substrate, for example. 

[0056] In the illustrated embodiment, the various calculation devices (elements 23, 
30-36 and 80 (if provided)) preferably are implemented using a suitably programmed general 
purpose computer, e.g., a microprocessor, microcontroller or other processor device (CPU or 
MPU). It will be appreciated by those skilled in the art, that the calculation devices also can 
be implemented as a single special purpose integrated circuit (e.g., ASIC) having a main or 
central processor section for overall, system-level control, and separate sections dedicated to 
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performing various different specific computations, functions and other processes under 
control of the central processor section. The calculation devices (which also can be referred 
to as a controller) also can be implemented using a plurality of separate dedicated or 
programmable integrated or other electronic circuits or devices (e.g., hardwired electronic or 
logic circuits such as discrete element circuits, or programmable logic devices such as PLDs, 
PL As, PALs or the like). The controller also can be implemented using a suitably 
programmed general purpose computer in conjunction with one or more peripheral (e.g., 
integrated circuit) data and signal processing devices. In general, any device or assembly of 
devices on which a finite state machine capable of implementing the described procedures can 
be used as the calculation devices/controller of the invention. 

[0057] While the invention has been described with reference to preferred 
embodiments thereof, it is to be understood that the invention is not limited to the preferred 
embodiments or constructions. To the contrary, the invention is intended to cover various 
modifications and equivalent arrangements. In addition, while the various elements of the 
preferred embodiments are shown in various combinations and configurations, which are 
exemplary, other combinations and configurations, including more, less or only a single 
element, are also within the spirit and scope of the invention. 



